Decision Tree Modeling with Relational Views

نویسندگان

  • Fadila Bentayeb
  • Jérôme Darmont
چکیده

Data mining is a useful decision support technique that can be used to discover production rules in warehouses or corporate data. Data mining research has made much effort to apply various mining algorithms efficiently on large databases. However, a serious problem in their practical application is the long processing time of such algorithms. Nowadays, one of the key challenges is to integrate data mining methods within the framework of traditional database systems. Indeed, such implementations can take advantage of the efficiency provided by SQL engines. In this paper, we propose an integrating approach for decision trees within a classical database system. In other words, we try to discover knowledge from relational databases, in the form of production rules, via a procedure embedding SQL queries. The obtained decision tree is defined by successive, related relational views. Each view corresponds to a given population in the underlying decision tree. We selected the classical Induction Decision Tree (ID3) algorithm to build the decision tree. To prove that our implementation of ID3 works properly, we successfully compared the output of our procedure with the output of an existing and validated data mining software, SIPINA. Furthermore, since our approach is tuneable, it can be generalized to any other similar decision tree-based method.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Presenting a Model to Assess Organizational Performance Based on the Concept of Knowledge Management Using Regression Model, Decision Tree, Gray Relational Analysis and DEMATEL Method (Case Study: National Library and Archives of Iran)

Many organizations have recognized that knowledge is the most important resource in today’s economy. With regards to knowledge-based views of the firm, organizations are actively embracing knowledge management with the expectation of acquiring and maintaining high levels of organizational performance. The relationship between knowledge management (KM) and organizational performance has been the...

متن کامل

Presenting a Model to Assess Organizational Performance Based on the Concept of Knowledge Management Using Regression Model, Decision Tree, Gray Relational Analysis and DEMATEL Method (Case Study: National Library and Archives of Iran)

Many organizations have recognized that knowledge is the most important resource in today’s economy. With regards to knowledge-based views of the firm, organizations are actively embracing knowledge management with the expectation of acquiring and maintaining high levels of organizational performance. The relationship between knowledge management (KM) and organizational performance has been the...

متن کامل

Modeling and development of a decision support system for supplier selection in the process industry

This paper presents the development of a model based decision support system with a case study on solving the supplier selection problem in a chemical processing industry. For the evaluation and selection of supplier, the analytical hierarchy process (AHP) and grey relational analysis (GRA) were used. The intention of the study is to propose an appropriate platform for process industries in ...

متن کامل

Extended SQL Aggregation for Database Transformation

To prepare a normalized data set from relational database for analysis requires significant efforts and it is time consuming task. The main reason is that, in general the database grows with many tables and views that must be joined, aggregated and transformed in order to build the required data set. As result, most of the SQL queries are written independently multiple times and in disorganize ...

متن کامل

Ranking stocks of listed companies on Tehran stock exchange using a hybrid model of decision tree and logistic regression

Much research has introduced linear or nonlinear models using statistical models and machine learning tools in artificial intelligence to estimate Iran's rate of return. The primary purpose of these methods is simultaneously use different independent variables to improve stock return rates' modeling. However, in predicting the rate of return, in addition to the modeling method, the degree of co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002